Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 59354 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 8.2 MiB |
| Average record size in memory | 144.0 B |
Variable types
| Numeric | 9 |
|---|---|
| Categorical | 9 |
Price is highly correlated with Area | High correlation |
Number of rooms is highly correlated with Area | High correlation |
Area is highly correlated with Price and 1 other fields | High correlation |
Garden Area is highly correlated with Surface of the land | High correlation |
Surface of the land is highly correlated with Garden Area | High correlation |
Price is highly correlated with Area and 1 other fields | High correlation |
Number of rooms is highly correlated with Area and 1 other fields | High correlation |
Area is highly correlated with Price and 2 other fields | High correlation |
Garden Area is highly correlated with Surface of the land | High correlation |
Surface of the land is highly correlated with Price and 3 other fields | High correlation |
Number of rooms is highly correlated with Area and 1 other fields | High correlation |
Area is highly correlated with Number of rooms and 1 other fields | High correlation |
Surface of the land is highly correlated with Number of rooms and 1 other fields | High correlation |
Unnamed: 0 is highly correlated with Type of property | High correlation |
Surface of the land is highly correlated with Garden Area | High correlation |
Locality is highly correlated with Province and 1 other fields | High correlation |
Area is highly correlated with Number of rooms | High correlation |
Number of rooms is highly correlated with Area | High correlation |
Garden Area is highly correlated with Surface of the land | High correlation |
Province is highly correlated with Locality and 1 other fields | High correlation |
Region is highly correlated with Locality and 1 other fields | High correlation |
Type of property is highly correlated with Unnamed: 0 | High correlation |
Province is highly correlated with Region | High correlation |
Region is highly correlated with Province | High correlation |
Number of rooms is highly skewed (γ1 = 22.35695618) | Skewed |
Terrace Area is highly skewed (γ1 = 83.44226517) | Skewed |
Garden Area is highly skewed (γ1 = 187.3438513) | Skewed |
Surface of the land is highly skewed (γ1 = 185.9877999) | Skewed |
PriceperMeter is highly skewed (γ1 = 113.6810042) | Skewed |
Unnamed: 0 has unique values | Unique |
Number of rooms has 685 (1.2%) zeros | Zeros |
Terrace Area has 37451 (63.1%) zeros | Zeros |
Garden Area has 49295 (83.1%) zeros | Zeros |
Reproduction
| Analysis started | 2021-05-31 14:08:56.479474 |
|---|---|
| Analysis finished | 2021-05-31 14:09:08.670319 |
| Duration | 12.19 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 59354 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29897.7879 |
| Minimum | 0 |
|---|---|
| Maximum | 59615 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 463.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2967.65 |
| Q1 | 15100.25 |
| median | 29938.5 |
| Q3 | 44776.75 |
| 95-th percentile | 56647.35 |
| Maximum | 59615 |
| Range | 59615 |
| Interquartile range (IQR) | 29676.5 |
Descriptive statistics
| Standard deviation | 17193.88567 |
|---|---|
| Coefficient of variation (CV) | 0.5750888904 |
| Kurtosis | -1.194182304 |
| Mean | 29897.7879 |
| Median Absolute Deviation (MAD) | 14838.5 |
| Skewness | -0.008322199244 |
| Sum | 1774553303 |
| Variance | 295629704.4 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 38200 | 1 | < 0.1% |
| 34106 | 1 | < 0.1% |
| 36155 | 1 | < 0.1% |
| 46396 | 1 | < 0.1% |
| 48445 | 1 | < 0.1% |
| 42302 | 1 | < 0.1% |
| 44351 | 1 | < 0.1% |
| 21856 | 1 | < 0.1% |
| 23905 | 1 | < 0.1% |
| Other values (59344) | 59344 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 59615 | 1 | |
| 59614 | 1 | |
| 59613 | 1 | |
| 59612 | 1 | |
| 59611 | 1 | |
| 59610 | 1 | |
| 59609 | 1 | |
| 59608 | 1 | |
| 59607 | 1 | |
| 59606 | 1 |
| Distinct | 972 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5303.593288 |
| Minimum | 1000 |
|---|---|
| Maximum | 9991 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 463.8 KiB |
Quantile statistics
| Minimum | 1000 |
|---|---|
| 5-th percentile | 1060 |
| Q1 | 2200 |
| median | 4960 |
| Q3 | 8430 |
| 95-th percentile | 9340 |
| Maximum | 9991 |
| Range | 8991 |
| Interquartile range (IQR) | 6230 |
Descriptive statistics
| Standard deviation | 3075.732929 |
|---|---|
| Coefficient of variation (CV) | 0.579933785 |
| Kurtosis | -1.595137598 |
| Mean | 5303.593288 |
| Median Absolute Deviation (MAD) | 3260 |
| Skewness | -0.02947427814 |
| Sum | 314789476 |
| Variance | 9460133.052 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1050 | 1131 | 1.9% |
| 8400 | 1119 | 1.9% |
| 1180 | 1082 | 1.8% |
| 9000 | 1070 | 1.8% |
| 8300 | 1065 | 1.8% |
| 2000 | 975 | 1.6% |
| 1000 | 877 | 1.5% |
| 4000 | 812 | 1.4% |
| 8000 | 665 | 1.1% |
| 8370 | 658 | 1.1% |
| Other values (962) | 49900 |
| Value | Count | Frequency (%) |
| 1000 | 877 | |
| 1020 | 110 | 0.2% |
| 1030 | 319 | 0.5% |
| 1040 | 370 | 0.6% |
| 1050 | 1131 | |
| 1060 | 185 | 0.3% |
| 1070 | 433 | 0.7% |
| 1080 | 266 | 0.4% |
| 1081 | 105 | 0.2% |
| 1082 | 75 | 0.1% |
| Value | Count | Frequency (%) |
| 9991 | 60 | |
| 9990 | 46 | |
| 9988 | 4 | < 0.1% |
| 9982 | 1 | < 0.1% |
| 9980 | 4 | < 0.1% |
| 9971 | 15 | < 0.1% |
| 9970 | 3 | < 0.1% |
| 9968 | 9 | < 0.1% |
| 9961 | 1 | < 0.1% |
| 9960 | 6 | < 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 463.8 KiB |
| apartment | |
|---|---|
| house |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 7.246925228 |
| Min length | 5 |
Characters and Unicode
| Total characters | 430134 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | apartment |
|---|---|
| 2nd row | apartment |
| 3rd row | apartment |
| 4th row | apartment |
| 5th row | apartment |
Common Values
| Value | Count | Frequency (%) |
| apartment | 33341 | |
| house | 26013 |
Length
Pie chart
| Value | Count | Frequency (%) |
| apartment | 33341 | |
| house | 26013 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 66682 | |
| t | 66682 | |
| e | 59354 | |
| p | 33341 | |
| r | 33341 | |
| m | 33341 | |
| n | 33341 | |
| h | 26013 | 6.0% |
| o | 26013 | 6.0% |
| u | 26013 | 6.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 430134 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 66682 | |
| t | 66682 | |
| e | 59354 | |
| p | 33341 | |
| r | 33341 | |
| m | 33341 | |
| n | 33341 | |
| h | 26013 | 6.0% |
| o | 26013 | 6.0% |
| u | 26013 | 6.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 430134 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 66682 | |
| t | 66682 | |
| e | 59354 | |
| p | 33341 | |
| r | 33341 | |
| m | 33341 | |
| n | 33341 | |
| h | 26013 | 6.0% |
| o | 26013 | 6.0% |
| u | 26013 | 6.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 430134 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 66682 | |
| t | 66682 | |
| e | 59354 | |
| p | 33341 | |
| r | 33341 | |
| m | 33341 | |
| n | 33341 | |
| h | 26013 | 6.0% |
| o | 26013 | 6.0% |
| u | 26013 | 6.0% |
| Distinct | 7344 |
|---|---|
| Distinct (%) | 12.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 370636.9388 |
| Minimum | 33500 |
|---|---|
| Maximum | 9876543 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 463.8 KiB |
Quantile statistics
| Minimum | 33500 |
|---|---|
| 5-th percentile | 126999.65 |
| Q1 | 209999 |
| median | 276658 |
| Q3 | 385000 |
| 95-th percentile | 894999 |
| Maximum | 9876543 |
| Range | 9843043 |
| Interquartile range (IQR) | 175001 |
Descriptive statistics
| Standard deviation | 382064.1726 |
|---|---|
| Coefficient of variation (CV) | 1.030831341 |
| Kurtosis | 59.8741589 |
| Mean | 370636.9388 |
| Median Absolute Deviation (MAD) | 81658 |
| Skewness | 6.068615237 |
| Sum | 2.199878487 × 1010 |
| Variance | 1.45973032 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 249000 | 620 | 1.0% |
| 299000 | 591 | 1.0% |
| 295000 | 554 | 0.9% |
| 225000 | 525 | 0.9% |
| 275000 | 486 | 0.8% |
| 199000 | 485 | 0.8% |
| 235000 | 479 | 0.8% |
| 265000 | 419 | 0.7% |
| 325000 | 418 | 0.7% |
| 285000 | 413 | 0.7% |
| Other values (7334) | 54364 |
| Value | Count | Frequency (%) |
| 33500 | 1 | < 0.1% |
| 34000 | 1 | < 0.1% |
| 34999 | 3 | < 0.1% |
| 35000 | 4 | |
| 37000 | 1 | < 0.1% |
| 37999 | 1 | < 0.1% |
| 39000 | 2 | < 0.1% |
| 39900 | 1 | < 0.1% |
| 39999 | 4 | |
| 40000 | 9 |
| Value | Count | Frequency (%) |
| 9876543 | 1 | < 0.1% |
| 9500000 | 1 | < 0.1% |
| 8500000 | 1 | < 0.1% |
| 6500000 | 2 | < 0.1% |
| 6499999 | 1 | < 0.1% |
| 6399999 | 1 | < 0.1% |
| 5950000 | 5 | |
| 5949999 | 3 | |
| 5849999 | 1 | < 0.1% |
| 5449999 | 1 | < 0.1% |
Number of rooms
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWEDZEROS| Distinct | 32 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.686120565 |
| Minimum | 0 |
|---|---|
| Maximum | 165 |
| Zeros | 685 |
| Zeros (%) | 1.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 463.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2.6 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 165 |
| Range | 165 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.554511204 |
|---|---|
| Coefficient of variation (CV) | 0.5787198177 |
| Kurtosis | 2049.588682 |
| Mean | 2.686120565 |
| Median Absolute Deviation (MAD) | 0.6 |
| Skewness | 22.35695618 |
| Sum | 159432 |
| Variance | 2.416505082 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 21823 | |
| 3 | 18316 | |
| 1 | 7016 | 11.8% |
| 4 | 6664 | 11.2% |
| 5 | 2473 | 4.2% |
| 6 | 1000 | 1.7% |
| 0 | 685 | 1.2% |
| 2.6 | 590 | 1.0% |
| 7 | 319 | 0.5% |
| 8 | 207 | 0.3% |
| Other values (22) | 261 | 0.4% |
| Value | Count | Frequency (%) |
| 0 | 685 | 1.2% |
| 1 | 7016 | 11.8% |
| 2 | 21823 | |
| 2.6 | 590 | 1.0% |
| 3 | 18316 | |
| 4 | 6664 | 11.2% |
| 5 | 2473 | 4.2% |
| 6 | 1000 | 1.7% |
| 7 | 319 | 0.5% |
| 8 | 207 | 0.3% |
| Value | Count | Frequency (%) |
| 165 | 1 | < 0.1% |
| 50 | 1 | < 0.1% |
| 40 | 1 | < 0.1% |
| 37 | 1 | < 0.1% |
| 34 | 2 | |
| 32 | 1 | < 0.1% |
| 30 | 3 | |
| 25 | 3 | |
| 24 | 3 | |
| 22 | 1 | < 0.1% |
| Distinct | 771 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 149.5463153 |
| Minimum | 1 |
|---|---|
| Maximum | 11366 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 463.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 55 |
| Q1 | 88 |
| median | 119 |
| Q3 | 170 |
| 95-th percentile | 344 |
| Maximum | 11366 |
| Range | 11365 |
| Interquartile range (IQR) | 82 |
Descriptive statistics
| Standard deviation | 130.3195885 |
|---|---|
| Coefficient of variation (CV) | 0.8714329617 |
| Kurtosis | 1022.147654 |
| Mean | 149.5463153 |
| Median Absolute Deviation (MAD) | 37 |
| Skewness | 16.55504415 |
| Sum | 8876172 |
| Variance | 16983.19514 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 1201 | 2.0% |
| 90 | 1136 | 1.9% |
| 120 | 1017 | 1.7% |
| 110 | 1003 | 1.7% |
| 80 | 983 | 1.7% |
| 150 | 927 | 1.6% |
| 85 | 901 | 1.5% |
| 140 | 866 | 1.5% |
| 130 | 836 | 1.4% |
| 95 | 833 | 1.4% |
| Other values (761) | 49651 |
| Value | Count | Frequency (%) |
| 1 | 2 | < 0.1% |
| 5 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 12 | 1 | < 0.1% |
| 13 | 3 | < 0.1% |
| 14 | 1 | < 0.1% |
| 15 | 16 | |
| 16 | 30 | |
| 17 | 26 |
| Value | Count | Frequency (%) |
| 11366 | 1 | < 0.1% |
| 4000 | 3 | |
| 3989 | 1 | < 0.1% |
| 3621 | 1 | < 0.1% |
| 3600 | 1 | < 0.1% |
| 2880 | 1 | < 0.1% |
| 2600 | 1 | < 0.1% |
| 2500 | 4 | |
| 2000 | 1 | < 0.1% |
| 1925 | 2 |
Fully equipped kitchen
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 463.8 KiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 178062 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 51127 | |
| 1.0 | 8227 | 13.9% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0.0 | 51127 | |
| 1.0 | 8227 | 13.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 110481 | |
| . | 59354 | |
| 1 | 8227 | 4.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 118708 | |
| Other Punctuation | 59354 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 110481 | |
| 1 | 8227 | 6.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 59354 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 178062 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 110481 | |
| . | 59354 | |
| 1 | 8227 | 4.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 178062 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 110481 | |
| . | 59354 | |
| 1 | 8227 | 4.6% |
Furnished
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 463.8 KiB |
| 0.0 | |
|---|---|
| 1.0 | 2146 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 178062 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 57208 | |
| 1.0 | 2146 | 3.6% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0.0 | 57208 | |
| 1.0 | 2146 | 3.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 116562 | |
| . | 59354 | |
| 1 | 2146 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 118708 | |
| Other Punctuation | 59354 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 116562 | |
| 1 | 2146 | 1.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 59354 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 178062 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 116562 | |
| . | 59354 | |
| 1 | 2146 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 178062 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 116562 | |
| . | 59354 | |
| 1 | 2146 | 1.2% |
Open fire
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 463.8 KiB |
| 0.0 | |
|---|---|
| 1.0 | 2337 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 178062 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 57017 | |
| 1.0 | 2337 | 3.9% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0.0 | 57017 | |
| 1.0 | 2337 | 3.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 116371 | |
| . | 59354 | |
| 1 | 2337 | 1.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 118708 | |
| Other Punctuation | 59354 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 116371 | |
| 1 | 2337 | 2.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 59354 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 178062 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 116371 | |
| . | 59354 | |
| 1 | 2337 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 178062 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 116371 | |
| . | 59354 | |
| 1 | 2337 | 1.3% |
| Distinct | 304 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.63459918 |
| Minimum | 0 |
|---|---|
| Maximum | 9636 |
| Zeros | 37451 |
| Zeros (%) | 63.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 463.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 11 |
| 95-th percentile | 44.35 |
| Maximum | 9636 |
| Range | 9636 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 62.09448707 |
|---|---|
| Coefficient of variation (CV) | 5.838911838 |
| Kurtosis | 10898.802 |
| Mean | 10.63459918 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 83.44226517 |
| Sum | 631206 |
| Variance | 3855.725325 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 37451 | |
| 10 | 1402 | 2.4% |
| 20 | 1077 | 1.8% |
| 12 | 1040 | 1.8% |
| 15 | 1028 | 1.7% |
| 8 | 1001 | 1.7% |
| 6 | 906 | 1.5% |
| 9 | 896 | 1.5% |
| 14 | 722 | 1.2% |
| 7 | 694 | 1.2% |
| Other values (294) | 13137 | 22.1% |
| Value | Count | Frequency (%) |
| 0 | 37451 | |
| 1 | 81 | 0.1% |
| 2 | 316 | 0.5% |
| 3 | 365 | 0.6% |
| 4 | 622 | 1.0% |
| 5 | 618 | 1.0% |
| 6 | 906 | 1.5% |
| 7 | 694 | 1.2% |
| 8 | 1001 | 1.7% |
| 9 | 896 | 1.5% |
| Value | Count | Frequency (%) |
| 9636 | 1 | < 0.1% |
| 4547 | 1 | < 0.1% |
| 4000 | 1 | < 0.1% |
| 3800 | 1 | < 0.1% |
| 3000 | 1 | < 0.1% |
| 1958 | 1 | < 0.1% |
| 1890 | 1 | < 0.1% |
| 1613 | 1 | < 0.1% |
| 1500 | 3 | |
| 1463 | 1 | < 0.1% |
| Distinct | 1163 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 147.9400714 |
| Minimum | 0 |
|---|---|
| Maximum | 950002 |
| Zeros | 49295 |
| Zeros (%) | 83.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 463.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 366.7 |
| Maximum | 950002 |
| Range | 950002 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 4291.863315 |
|---|---|
| Coefficient of variation (CV) | 29.01082359 |
| Kurtosis | 40617.19413 |
| Mean | 147.9400714 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 187.3438513 |
| Sum | 8780835 |
| Variance | 18420090.71 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 49295 | |
| 100 | 294 | 0.5% |
| 200 | 252 | 0.4% |
| 50 | 232 | 0.4% |
| 300 | 201 | 0.3% |
| 150 | 188 | 0.3% |
| 500 | 163 | 0.3% |
| 40 | 154 | 0.3% |
| 60 | 152 | 0.3% |
| 30 | 151 | 0.3% |
| Other values (1153) | 8272 | 13.9% |
| Value | Count | Frequency (%) |
| 0 | 49295 | |
| 1 | 108 | 0.2% |
| 2 | 1 | < 0.1% |
| 3 | 6 | < 0.1% |
| 4 | 8 | < 0.1% |
| 5 | 6 | < 0.1% |
| 6 | 13 | < 0.1% |
| 7 | 14 | < 0.1% |
| 8 | 18 | < 0.1% |
| 9 | 17 | < 0.1% |
| Value | Count | Frequency (%) |
| 950002 | 1 | < 0.1% |
| 235000 | 1 | < 0.1% |
| 150000 | 1 | < 0.1% |
| 100000 | 1 | < 0.1% |
| 80978 | 1 | < 0.1% |
| 80000 | 3 | |
| 75000 | 1 | < 0.1% |
| 65000 | 2 | |
| 54515 | 1 | < 0.1% |
| 54000 | 1 | < 0.1% |
Surface of the land
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWED| Distinct | 2065 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 308.1209859 |
| Minimum | 1 |
|---|---|
| Maximum | 950852 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 463.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 58 |
| Q1 | 95 |
| median | 135 |
| Q3 | 213 |
| 95-th percentile | 668 |
| Maximum | 950852 |
| Range | 950851 |
| Interquartile range (IQR) | 118 |
Descriptive statistics
| Standard deviation | 4305.753659 |
|---|---|
| Coefficient of variation (CV) | 13.97423043 |
| Kurtosis | 40213.19274 |
| Mean | 308.1209859 |
| Median Absolute Deviation (MAD) | 48 |
| Skewness | 185.9877999 |
| Sum | 18288213 |
| Variance | 18539514.57 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 858 | 1.4% |
| 90 | 770 | 1.3% |
| 120 | 736 | 1.2% |
| 110 | 723 | 1.2% |
| 80 | 691 | 1.2% |
| 150 | 665 | 1.1% |
| 140 | 655 | 1.1% |
| 130 | 648 | 1.1% |
| 105 | 634 | 1.1% |
| 95 | 612 | 1.0% |
| Other values (2055) | 52362 |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 12 | 1 | < 0.1% |
| 13 | 2 | < 0.1% |
| 14 | 1 | < 0.1% |
| 15 | 15 | |
| 16 | 25 | |
| 17 | 25 |
| Value | Count | Frequency (%) |
| 950852 | 1 | |
| 235240 | 1 | |
| 151340 | 1 | |
| 100220 | 1 | |
| 83228 | 1 | |
| 80245 | 1 | |
| 80216 | 1 | |
| 80195 | 1 | |
| 75830 | 1 | |
| 66500 | 1 |
Number of facades
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 463.8 KiB |
| 2.0 | |
|---|---|
| 4.0 | |
| 3.0 | |
| 1.0 | 361 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 178062 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 2.0 |
| 3rd row | 2.0 |
| 4th row | 2.0 |
| 5th row | 4.0 |
Common Values
| Value | Count | Frequency (%) |
| 2.0 | 44306 | |
| 4.0 | 7672 | 12.9% |
| 3.0 | 7015 | 11.8% |
| 1.0 | 361 | 0.6% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 2.0 | 44306 | |
| 4.0 | 7672 | 12.9% |
| 3.0 | 7015 | 11.8% |
| 1.0 | 361 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 59354 | |
| 0 | 59354 | |
| 2 | 44306 | |
| 4 | 7672 | 4.3% |
| 3 | 7015 | 3.9% |
| 1 | 361 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 118708 | |
| Other Punctuation | 59354 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 59354 | |
| 2 | 44306 | |
| 4 | 7672 | 6.5% |
| 3 | 7015 | 5.9% |
| 1 | 361 | 0.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 59354 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 178062 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 59354 | |
| 0 | 59354 | |
| 2 | 44306 | |
| 4 | 7672 | 4.3% |
| 3 | 7015 | 3.9% |
| 1 | 361 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 178062 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 59354 | |
| 0 | 59354 | |
| 2 | 44306 | |
| 4 | 7672 | 4.3% |
| 3 | 7015 | 3.9% |
| 1 | 361 | 0.2% |
Swimming pool
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 463.8 KiB |
| 0.0 | |
|---|---|
| 1.0 | 973 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 178062 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 58381 | |
| 1.0 | 973 | 1.6% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0.0 | 58381 | |
| 1.0 | 973 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 117735 | |
| . | 59354 | |
| 1 | 973 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 118708 | |
| Other Punctuation | 59354 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 117735 | |
| 1 | 973 | 0.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 59354 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 178062 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 117735 | |
| . | 59354 | |
| 1 | 973 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 178062 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 117735 | |
| . | 59354 | |
| 1 | 973 | 0.5% |
State of the building
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 463.8 KiB |
| good | |
|---|---|
| medium | |
| to renovate | |
| new | 1250 |
Length
| Max length | 11 |
|---|---|
| Median length | 6 |
| Mean length | 5.475738788 |
| Min length | 3 |
Characters and Unicode
| Total characters | 325007 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | medium |
|---|---|
| 2nd row | medium |
| 3rd row | medium |
| 4th row | medium |
| 5th row | medium |
Common Values
| Value | Count | Frequency (%) |
| good | 27056 | |
| medium | 25699 | |
| to renovate | 5349 | 9.0% |
| new | 1250 | 2.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| good | 27056 | |
| medium | 25699 | |
| renovate | 5349 | 8.3% |
| to | 5349 | 8.3% |
| new | 1250 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 64810 | |
| d | 52755 | |
| m | 51398 | |
| e | 37647 | |
| g | 27056 | |
| i | 25699 | 7.9% |
| u | 25699 | 7.9% |
| t | 10698 | 3.3% |
| n | 6599 | 2.0% |
| 5349 | 1.6% | |
| Other values (4) | 17297 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 319658 | |
| Space Separator | 5349 | 1.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 64810 | |
| d | 52755 | |
| m | 51398 | |
| e | 37647 | |
| g | 27056 | |
| i | 25699 | 8.0% |
| u | 25699 | 8.0% |
| t | 10698 | 3.3% |
| n | 6599 | 2.1% |
| r | 5349 | 1.7% |
| Other values (3) | 11948 | 3.7% |
Space Separator
| Value | Count | Frequency (%) |
| 5349 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 319658 | |
| Common | 5349 | 1.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 64810 | |
| d | 52755 | |
| m | 51398 | |
| e | 37647 | |
| g | 27056 | |
| i | 25699 | 8.0% |
| u | 25699 | 8.0% |
| t | 10698 | 3.3% |
| n | 6599 | 2.1% |
| r | 5349 | 1.7% |
| Other values (3) | 11948 | 3.7% |
Common
| Value | Count | Frequency (%) |
| 5349 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 325007 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 64810 | |
| d | 52755 | |
| m | 51398 | |
| e | 37647 | |
| g | 27056 | |
| i | 25699 | 7.9% |
| u | 25699 | 7.9% |
| t | 10698 | 3.3% |
| n | 6599 | 2.0% |
| 5349 | 1.6% | |
| Other values (4) | 17297 | 5.3% |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 463.8 KiB |
| Flandre Occidental | |
|---|---|
| Flandre Oriental | |
| Hainaut | |
| Brussel | |
| Anvers | |
| Other values (6) |
Length
| Max length | 18 |
|---|---|
| Median length | 8 |
| Mean length | 11.14202918 |
| Min length | 5 |
Characters and Unicode
| Total characters | 661324 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Anvers |
|---|---|
| 2nd row | Brabant Flamand |
| 3rd row | Flandre Occidental |
| 4th row | Anvers |
| 5th row | Anvers |
Common Values
| Value | Count | Frequency (%) |
| Flandre Occidental | 12626 | |
| Flandre Oriental | 7352 | |
| Hainaut | 7096 | |
| Brussel | 6982 | |
| Anvers | 6903 | |
| Liège | 5652 | |
| Brabant Flamand | 4592 | 7.7% |
| Brabant Wallon | 2838 | 4.8% |
| Limbourg | 2756 | 4.6% |
| Namur | 1606 | 2.7% |
Length
| Value | Count | Frequency (%) |
| flandre | 19978 | |
| occidental | 12626 | |
| brabant | 7430 | 8.6% |
| oriental | 7352 | 8.5% |
| hainaut | 7096 | 8.2% |
| brussel | 6982 | 8.0% |
| anvers | 6903 | 8.0% |
| liège | 5652 | 6.5% |
| flamand | 4592 | 5.3% |
| wallon | 2838 | 3.3% |
| Other values (3) | 5313 | 6.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 82636 | |
| n | 68815 | 10.4% |
| e | 60444 | 9.1% |
| l | 57206 | 8.7% |
| r | 53958 | 8.2% |
| d | 37196 | 5.6% |
| i | 35482 | 5.4% |
| t | 34504 | 5.2% |
| 27408 | 4.1% | |
| c | 25252 | 3.8% |
| Other values (17) | 178423 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 547154 | |
| Uppercase Letter | 86762 | 13.1% |
| Space Separator | 27408 | 4.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 82636 | |
| n | 68815 | |
| e | 60444 | |
| l | 57206 | |
| r | 53958 | |
| d | 37196 | |
| i | 35482 | |
| t | 34504 | |
| c | 25252 | 4.6% |
| s | 20867 | 3.8% |
| Other values (8) | 70794 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 24570 | |
| O | 19978 | |
| B | 14412 | |
| L | 9359 | 10.8% |
| H | 7096 | 8.2% |
| A | 6903 | 8.0% |
| W | 2838 | 3.3% |
| N | 1606 | 1.9% |
Space Separator
| Value | Count | Frequency (%) |
| 27408 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 633916 | |
| Common | 27408 | 4.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 82636 | |
| n | 68815 | |
| e | 60444 | 9.5% |
| l | 57206 | 9.0% |
| r | 53958 | 8.5% |
| d | 37196 | 5.9% |
| i | 35482 | 5.6% |
| t | 34504 | 5.4% |
| c | 25252 | 4.0% |
| F | 24570 | 3.9% |
| Other values (16) | 153853 |
Common
| Value | Count | Frequency (%) |
| 27408 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 655672 | |
| Latin 1 Sup | 5652 | 0.9% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 82636 | |
| n | 68815 | 10.5% |
| e | 60444 | 9.2% |
| l | 57206 | 8.7% |
| r | 53958 | 8.2% |
| d | 37196 | 5.7% |
| i | 35482 | 5.4% |
| t | 34504 | 5.3% |
| 27408 | 4.2% | |
| c | 25252 | 3.9% |
| Other values (16) | 172771 |
Latin 1 Sup
| Value | Count | Frequency (%) |
| è | 5652 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 463.8 KiB |
| Flanders | |
|---|---|
| Wallonia | |
| Brussel |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.882366816 |
| Min length | 7 |
Characters and Unicode
| Total characters | 467850 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Flanders |
|---|---|
| 2nd row | Flanders |
| 3rd row | Flanders |
| 4th row | Flanders |
| 5th row | Flanders |
Common Values
| Value | Count | Frequency (%) |
| Flanders | 34229 | |
| Wallonia | 18143 | |
| Brussel | 6982 | 11.8% |
Length
Pie chart
| Value | Count | Frequency (%) |
| flanders | 34229 | |
| wallonia | 18143 | |
| brussel | 6982 | 11.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 77497 | |
| a | 70515 | |
| n | 52372 | |
| s | 48193 | |
| e | 41211 | |
| r | 41211 | |
| F | 34229 | |
| d | 34229 | |
| W | 18143 | 3.9% |
| o | 18143 | 3.9% |
| Other values (3) | 32107 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 408496 | |
| Uppercase Letter | 59354 | 12.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 77497 | |
| a | 70515 | |
| n | 52372 | |
| s | 48193 | |
| e | 41211 | |
| r | 41211 | |
| d | 34229 | |
| o | 18143 | 4.4% |
| i | 18143 | 4.4% |
| u | 6982 | 1.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 34229 | |
| W | 18143 | |
| B | 6982 | 11.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 467850 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 77497 | |
| a | 70515 | |
| n | 52372 | |
| s | 48193 | |
| e | 41211 | |
| r | 41211 | |
| F | 34229 | |
| d | 34229 | |
| W | 18143 | 3.9% |
| o | 18143 | 3.9% |
| Other values (3) | 32107 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 467850 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 77497 | |
| a | 70515 | |
| n | 52372 | |
| s | 48193 | |
| e | 41211 | |
| r | 41211 | |
| F | 34229 | |
| d | 34229 | |
| W | 18143 | 3.9% |
| o | 18143 | 3.9% |
| Other values (3) | 32107 |
| Distinct | 5852 |
|---|---|
| Distinct (%) | 9.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2709.809903 |
| Minimum | 109 |
|---|---|
| Maximum | 562499 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 463.8 KiB |
Quantile statistics
| Minimum | 109 |
|---|---|
| 5-th percentile | 1000 |
| Q1 | 1804 |
| median | 2441 |
| Q3 | 3149 |
| 95-th percentile | 5132 |
| Maximum | 562499 |
| Range | 562390 |
| Interquartile range (IQR) | 1345 |
Descriptive statistics
| Standard deviation | 3121.018553 |
|---|---|
| Coefficient of variation (CV) | 1.151748154 |
| Kurtosis | 18820.27894 |
| Mean | 2709.809903 |
| Median Absolute Deviation (MAD) | 668 |
| Skewness | 113.6810042 |
| Sum | 160838057 |
| Variance | 9740756.808 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2500 | 243 | 0.4% |
| 3000 | 169 | 0.3% |
| 2000 | 131 | 0.2% |
| 1666 | 122 | 0.2% |
| 2499 | 113 | 0.2% |
| 1500 | 108 | 0.2% |
| 2333 | 100 | 0.2% |
| 2142 | 86 | 0.1% |
| 2250 | 86 | 0.1% |
| 2600 | 79 | 0.1% |
| Other values (5842) | 58117 |
| Value | Count | Frequency (%) |
| 109 | 1 | |
| 113 | 1 | |
| 166 | 2 | |
| 167 | 1 | |
| 189 | 1 | |
| 190 | 1 | |
| 211 | 1 | |
| 214 | 1 | |
| 220 | 1 | |
| 230 | 1 |
| Value | Count | Frequency (%) |
| 562499 | 1 | |
| 294999 | 1 | |
| 143000 | 1 | |
| 99999 | 1 | |
| 48536 | 1 | |
| 41249 | 1 | |
| 37312 | 1 | |
| 37142 | 1 | |
| 34999 | 1 | |
| 30813 | 2 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Unnamed: 0 | Locality | Type of property | Price | Number of rooms | Area | Fully equipped kitchen | Furnished | Open fire | Terrace Area | Garden Area | Surface of the land | Number of facades | Swimming pool | State of the building | Province | Region | PriceperMeter | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 2970 | apartment | 764999.0 | 2.0 | 153.0 | 0.0 | 0.0 | 0.0 | 62.0 | 0.0 | 215.0 | 2.0 | 0.0 | medium | Anvers | Flanders | 4999.0 |
| 1 | 1 | 3200 | apartment | 294999.0 | 2.0 | 80.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 80.0 | 2.0 | 0.0 | medium | Brabant Flamand | Flanders | 3687.0 |
| 2 | 2 | 8211 | apartment | 233999.0 | 2.0 | 90.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 90.0 | 2.0 | 0.0 | medium | Flandre Occidental | Flanders | 2599.0 |
| 3 | 3 | 2630 | apartment | 329899.0 | 1.0 | 87.0 | 0.0 | 0.0 | 0.0 | 28.0 | 0.0 | 115.0 | 2.0 | 0.0 | medium | Anvers | Flanders | 3791.0 |
| 4 | 4 | 2630 | apartment | 359899.0 | 1.0 | 95.0 | 0.0 | 0.0 | 0.0 | 47.0 | 0.0 | 142.0 | 4.0 | 0.0 | medium | Anvers | Flanders | 3788.0 |
| 5 | 5 | 4432 | apartment | 248999.0 | 3.0 | 125.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 125.0 | 2.0 | 0.0 | medium | Liège | Wallonia | 1991.0 |
| 6 | 6 | 4432 | apartment | 412299.0 | 2.0 | 125.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 125.0 | 4.0 | 0.0 | medium | Liège | Wallonia | 3298.0 |
| 7 | 7 | 9300 | apartment | 144999.0 | 1.0 | 70.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 70.0 | 2.0 | 0.0 | medium | Flandre Oriental | Flanders | 2071.0 |
| 8 | 8 | 9300 | apartment | 238999.0 | 1.0 | 47.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 47.0 | 2.0 | 0.0 | medium | Flandre Oriental | Flanders | 5085.0 |
| 9 | 9 | 9300 | apartment | 129999.0 | 1.0 | 67.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 67.0 | 2.0 | 0.0 | medium | Flandre Oriental | Flanders | 1940.0 |
Last rows
| Unnamed: 0 | Locality | Type of property | Price | Number of rooms | Area | Fully equipped kitchen | Furnished | Open fire | Terrace Area | Garden Area | Surface of the land | Number of facades | Swimming pool | State of the building | Province | Region | PriceperMeter | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 59344 | 59606 | 4600 | house | 511376.0 | 3.0 | 203.0 | 0.0 | 0.0 | 0.0 | 49.0 | 0.0 | 252.0 | 4.0 | 0.0 | medium | Liège | Wallonia | 2519.0 |
| 59345 | 59607 | 8610 | house | 257211.0 | 3.0 | 134.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 134.0 | 3.0 | 0.0 | medium | Flandre Occidental | Flanders | 1919.0 |
| 59346 | 59608 | 7880 | house | 332200.0 | 3.0 | 170.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 170.0 | 4.0 | 0.0 | medium | Hainaut | Wallonia | 1954.0 |
| 59347 | 59609 | 7880 | house | 334900.0 | 3.0 | 165.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 165.0 | 4.0 | 0.0 | medium | Hainaut | Wallonia | 2029.0 |
| 59348 | 59610 | 7880 | house | 340500.0 | 3.0 | 167.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 167.0 | 4.0 | 0.0 | medium | Hainaut | Wallonia | 2038.0 |
| 59349 | 59611 | 8902 | house | 307242.0 | 3.0 | 150.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 150.0 | 3.0 | 0.0 | medium | Flandre Occidental | Flanders | 2048.0 |
| 59350 | 59612 | 9600 | house | 315000.0 | 3.0 | 150.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 150.0 | 3.0 | 0.0 | good | Flandre Oriental | Flanders | 2100.0 |
| 59351 | 59613 | 9600 | house | 315000.0 | 3.0 | 150.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 150.0 | 3.0 | 0.0 | good | Flandre Oriental | Flanders | 2100.0 |
| 59352 | 59614 | 6000 | house | 175000.0 | 4.0 | 205.0 | 0.0 | 0.0 | 0.0 | 23.0 | 600.0 | 828.0 | 2.0 | 0.0 | medium | Hainaut | Wallonia | 853.0 |
| 59353 | 59615 | 6000 | house | 185000.0 | 4.0 | 200.0 | 0.0 | 0.0 | 0.0 | 23.0 | 800.0 | 1023.0 | 3.0 | 0.0 | medium | Hainaut | Wallonia | 925.0 |